A Rule-Augmented Statistical Phrase-based Translation System
نویسندگان
چکیده
Interactive or Incremental Statistical Machine Translation (IMT) aims to provide a mechanism that allows the statistical models involved in the translation process to be incrementally updated and improved. The source of knowledge normally comes from users who either post-edit the entire translation or just provide the translations for wrongly translated domain-specific terminologies. Most of the existing work on IMT uses batch learning paradigm which does not allow translation systems to make use of the new input instantaneously. We introduce an adaptive MT framework with a Rule Definition Language (RDL) for users to amend MT results through translation rules or patterns. Experimental results show that our system acknowledges user feedback via RDL which improves the translations of the baseline system on three test sets for Vietnamese to English translation.
منابع مشابه
Enriching a statistical machine translation system trained on small parallel corpora with rule-based bilingual phrases
In this paper, we present a new hybridisation approach consisting of enriching the phrase table of a phrase-based statistical machine translation system with bilingual phrase pairs matching structural transfer rules and dictionary entries from a shallowtransfer rule-based machine translation system. We have tested this approach on different small parallel corpora scenarios, where pure statistic...
متن کاملThe ISL Phrase-Based MT System for the 2007 ACL Workshop on Statistical Machine Translation
In this paper we describe the Interactive Systems Laboratories (ISL) phrase-based machine translation system used in the shared task ”Machine Translation for European Languages” of the ACL 2007 Workshop on Statistical Machine Translation. We present results for a system combination of the ISL syntax-augmented MT system and the ISL phrase-based system by combining and rescoring the n-best lists ...
متن کاملNTT System Description for the WMT2006 Shared Task
We present two translation systems experimented for the shared-task of “Workshop on Statistical Machine Translation,” a phrase-based model and a hierarchical phrase-based model. The former uses a phrasal unit for translation, whereas the latter is conceptualized as a synchronousCFG in which phrases are hierarchically combined using non-terminals. Experiments showed that the hierarchical phraseb...
متن کاملA Lexicalized Reordering Model for Hierarchical Phrase-based Translation
Lexicalized reordering model plays a central role in phrase-based statistical machine translation systems. The reordering model specifies the orientation for each phrase and calculates its probability conditioned on the phrase. In this paper, we describe the necessity and the challenge of introducing such a reordering model for hierarchical phrase-based translation. To deal with the challenge, ...
متن کاملThe Universität Karlsruhe Translation System for the EACL-WMT 2009
In this paper we describe the statistical machine translation system of the Universität Karlsruhe developed for the translation task of the Fourth Workshop on Statistical Machine Translation. The state-ofthe-art phrase-based SMT system is augmented with alternative word reordering and alignment mechanisms as well as optional phrase table modifications. We participate in the constrained conditio...
متن کاملSyntax-Directed Phrase-based Statistical Machine Translation
The SDTS has been applied in the field of compiler and in transfer-based machine translation. After the parsing step, the syntactic structure of a sentence is identified. The parse tree will be analyzed, augmented, and transformed by later phases in the SMT system. Those phases are controlled by syntax. We use the stochastic SDTS to model such kind of translation process for phrase-based SMT. T...
متن کامل